The use of sub-band cepstrum in speaker verification
نویسندگان
چکیده
This paper focuses on the spectral representation of the sub-band cepstrum in relation to that of the full-band cepstrum. Through theoretical analysis it is shown that the net spectral information content of the cepstral coefficients with the same index in different sub-bands is only comparable to that of a full-band cepstral parameter whose quefrency is given by the product of that specific index with the number of sub-bands. A new method is proposed to tackle this deficiency of the sub-band cepstrum when it is used in the context of text-dependent speaker verification. The experimental investigations have clearly demonstrated the effectiveness of this method in speaker verification.
منابع مشابه
Sub-band based speaker verification using dynamic recombination weights
This paper describes a new method for generating the recombination weights in sub-band based speaker verification. The approach, which is based on the use of background speaker models, attempts to reduce the effect of any mismatch between the band-limited segments of the test utterance and the corresponding sections in the target speaker model. The discussion also includes an analysis of other ...
متن کاملRobust speaker recognition using spectro-temporal autoregressive models
Speaker recognition in noisy environments is challenging when there is a mis-match in the data used for enrollment and verification. In this paper, we propose a robust feature extraction scheme based on spectro-temporal modulation filtering using two-dimensional (2-D) autoregressive (AR) models. The first step is the AR modeling of the sub-band temporal envelopes by the application of the linea...
متن کاملRobustness to additive noise of locally-normalized cepstral coefficients in speaker verification
In this paper the performance of a new feature set, Locally Normalized Cepstral Coefficients (LNCC) is evaluated for a speaker verification task with short testing utterances in additive noise. The results presented here show that LNCC outperforms baseline MFCC features when SNR is lower than 15 dB. The average relative reduction in EER achieved by LNCC is 33%. The use of LNCC in combination wi...
متن کاملSub-band based text-dependent speaker verification
, (p s c S p th cepstral coefficient of the s th sub-bands { c 1 (1,p) = c(p) is the p th full-band cepstral parameter} S number of sub-bands Y(k) k th log spectral magnitude K number of log spectral magnitudes) (k Y ′ ′ k th log-energy outputs of the mel-scale filterbank K ′ ′ number of log-energy outputs of the mel-scale filterbank h t weight associated with the t th segment U number of compe...
متن کاملImproved Data Modeling for Text-Dependent Speaker Recognition Using Sub-Band Processing
A growing body of recent work documents the potential benefits of sub-band processing over wideband processing in automatic speech recognition and, less usually, speaker recognition. It is often found that the subband approach delivers performance improvements (especially in the presence of noise), but not always so. This raises the question of precisely when and how sub-band processing might b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000